Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 1030 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 11 |
| Duplicate rows (%) | 1.1% |
| Total size in memory | 72.5 KiB |
| Average record size in memory | 72.1 B |
Variable types
| Numeric | 9 |
|---|
| Dataset has 11 (1.1%) duplicate rows | Duplicates |
Water (component 4)(kg in a m^3 mixture) is highly correlated with Superplasticizer (component 5)(kg in a m^3 mixture) | High correlation |
Superplasticizer (component 5)(kg in a m^3 mixture) is highly correlated with Water (component 4)(kg in a m^3 mixture) | High correlation |
Age (day) is highly correlated with Concrete compressive strength(MPa, megapascals) | High correlation |
Concrete compressive strength(MPa, megapascals) is highly correlated with Age (day) | High correlation |
Water (component 4)(kg in a m^3 mixture) is highly correlated with Superplasticizer (component 5)(kg in a m^3 mixture) | High correlation |
Superplasticizer (component 5)(kg in a m^3 mixture) is highly correlated with Water (component 4)(kg in a m^3 mixture) | High correlation |
Water (component 4)(kg in a m^3 mixture) is highly correlated with Superplasticizer (component 5)(kg in a m^3 mixture) | High correlation |
Superplasticizer (component 5)(kg in a m^3 mixture) is highly correlated with Water (component 4)(kg in a m^3 mixture) | High correlation |
Cement (component 1)(kg in a m^3 mixture) is highly correlated with Blast Furnace Slag (component 2)(kg in a m^3 mixture) and 6 other fields | High correlation |
Blast Furnace Slag (component 2)(kg in a m^3 mixture) is highly correlated with Cement (component 1)(kg in a m^3 mixture) and 5 other fields | High correlation |
Fly Ash (component 3)(kg in a m^3 mixture) is highly correlated with Cement (component 1)(kg in a m^3 mixture) and 5 other fields | High correlation |
Water (component 4)(kg in a m^3 mixture) is highly correlated with Cement (component 1)(kg in a m^3 mixture) and 5 other fields | High correlation |
Superplasticizer (component 5)(kg in a m^3 mixture) is highly correlated with Cement (component 1)(kg in a m^3 mixture) and 5 other fields | High correlation |
Coarse Aggregate (component 6)(kg in a m^3 mixture) is highly correlated with Cement (component 1)(kg in a m^3 mixture) and 5 other fields | High correlation |
Fine Aggregate (component 7)(kg in a m^3 mixture) is highly correlated with Cement (component 1)(kg in a m^3 mixture) and 5 other fields | High correlation |
Concrete compressive strength(MPa, megapascals) is highly correlated with Cement (component 1)(kg in a m^3 mixture) | High correlation |
Blast Furnace Slag (component 2)(kg in a m^3 mixture) has 466 (45.2%) zeros | Zeros |
Fly Ash (component 3)(kg in a m^3 mixture) has 566 (55.0%) zeros | Zeros |
Superplasticizer (component 5)(kg in a m^3 mixture) has 379 (36.8%) zeros | Zeros |
Reproduction
| Analysis started | 2023-07-13 04:28:19.018554 |
|---|---|
| Analysis finished | 2023-07-13 04:28:37.728760 |
| Duration | 18.71 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 280 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 281.1656311 |
| Minimum | 102 |
|---|---|
| Maximum | 540 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 102 |
|---|---|
| 5-th percentile | 143.745 |
| Q1 | 192.375 |
| median | 272.9 |
| Q3 | 350 |
| 95-th percentile | 480 |
| Maximum | 540 |
| Range | 438 |
| Interquartile range (IQR) | 157.625 |
Descriptive statistics
| Standard deviation | 104.5071416 |
|---|---|
| Coefficient of variation (CV) | 0.3716924478 |
| Kurtosis | -0.5206632839 |
| Mean | 281.1656311 |
| Median Absolute Deviation (MAD) | 79.4 |
| Skewness | 0.5095174326 |
| Sum | 289600.6 |
| Variance | 10921.74265 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 425 | 20 | 1.9% |
| 362.6 | 20 | 1.9% |
| 251.37 | 15 | 1.5% |
| 446 | 14 | 1.4% |
| 310 | 14 | 1.4% |
| 331 | 13 | 1.3% |
| 250 | 13 | 1.3% |
| 475 | 13 | 1.3% |
| 387 | 12 | 1.2% |
| 349 | 12 | 1.2% |
| Other values (270) | 884 |
| Value | Count | Frequency (%) |
| 102 | 4 | |
| 108.3 | 4 | |
| 116 | 4 | |
| 122.6 | 4 | |
| 132 | 2 | 0.2% |
| 133 | 5 | |
| 133.1 | 1 | 0.1% |
| 134.7 | 1 | 0.1% |
| 135 | 2 | 0.2% |
| 135.7 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 540 | 9 | |
| 531.3 | 5 | |
| 528 | 1 | 0.1% |
| 525 | 7 | |
| 522 | 2 | 0.2% |
| 520 | 2 | 0.2% |
| 516 | 2 | 0.2% |
| 505 | 1 | 0.1% |
| 500.1 | 1 | 0.1% |
| 500 | 10 |
| Distinct | 187 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 73.89548544 |
| Minimum | 0 |
|---|---|
| Maximum | 359.4 |
| Zeros | 466 |
| Zeros (%) | 45.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 22 |
| Q3 | 142.95 |
| 95-th percentile | 236 |
| Maximum | 359.4 |
| Range | 359.4 |
| Interquartile range (IQR) | 142.95 |
Descriptive statistics
| Standard deviation | 86.27910364 |
|---|---|
| Coefficient of variation (CV) | 1.167582879 |
| Kurtosis | -0.5081392049 |
| Mean | 73.89548544 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 0.8007373534 |
| Sum | 76112.35 |
| Variance | 7444.083725 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 466 | |
| 189 | 30 | 2.9% |
| 106.3 | 20 | 1.9% |
| 24 | 14 | 1.4% |
| 20 | 12 | 1.2% |
| 145 | 11 | 1.1% |
| 19 | 10 | 1.0% |
| 22 | 8 | 0.8% |
| 26 | 8 | 0.8% |
| 190 | 7 | 0.7% |
| Other values (177) | 444 |
| Value | Count | Frequency (%) |
| 0 | 466 | |
| 0.02 | 5 | 0.5% |
| 11 | 4 | 0.4% |
| 13.61 | 5 | 0.5% |
| 15 | 5 | 0.5% |
| 17.2 | 1 | 0.1% |
| 17.5 | 1 | 0.1% |
| 17.6 | 1 | 0.1% |
| 19 | 10 | 1.0% |
| 20 | 12 | 1.2% |
| Value | Count | Frequency (%) |
| 359.4 | 2 | 0.2% |
| 342.1 | 2 | 0.2% |
| 316.1 | 2 | 0.2% |
| 305.3 | 4 | |
| 290.2 | 2 | 0.2% |
| 288 | 4 | |
| 282.8 | 4 | |
| 272.8 | 2 | 0.2% |
| 262.2 | 5 | |
| 260 | 1 | 0.1% |
| Distinct | 163 |
|---|---|
| Distinct (%) | 15.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.18713592 |
| Minimum | 0 |
|---|---|
| Maximum | 200.1 |
| Zeros | 566 |
| Zeros (%) | 55.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 118.27 |
| 95-th percentile | 167.0055 |
| Maximum | 200.1 |
| Range | 200.1 |
| Interquartile range (IQR) | 118.27 |
Descriptive statistics
| Standard deviation | 63.99646938 |
|---|---|
| Coefficient of variation (CV) | 1.181026978 |
| Kurtosis | -1.328504785 |
| Mean | 54.18713592 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5374451101 |
| Sum | 55812.75 |
| Variance | 4095.548093 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 566 | |
| 141 | 16 | 1.6% |
| 118.27 | 15 | 1.5% |
| 79 | 14 | 1.4% |
| 94 | 13 | 1.3% |
| 174.24 | 10 | 1.0% |
| 98.75 | 10 | 1.0% |
| 95.69 | 10 | 1.0% |
| 125.18 | 10 | 1.0% |
| 121.62 | 10 | 1.0% |
| Other values (153) | 356 |
| Value | Count | Frequency (%) |
| 0 | 566 | |
| 24.46 | 5 | 0.5% |
| 24.51 | 5 | 0.5% |
| 24.52 | 5 | 0.5% |
| 59 | 1 | 0.1% |
| 60 | 1 | 0.1% |
| 71 | 1 | 0.1% |
| 71.5 | 1 | 0.1% |
| 75.6 | 1 | 0.1% |
| 76 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 200.1 | 1 | 0.1% |
| 200 | 1 | 0.1% |
| 195 | 3 | |
| 194.9 | 1 | 0.1% |
| 194 | 1 | 0.1% |
| 193 | 1 | 0.1% |
| 190 | 1 | 0.1% |
| 187 | 1 | 0.1% |
| 185.3 | 1 | 0.1% |
| 185 | 2 |
Water (component 4)(kg in a m^3 mixture)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 205 |
|---|---|
| Distinct (%) | 19.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 181.5663592 |
| Minimum | 121.75 |
|---|---|
| Maximum | 247 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 121.75 |
|---|---|
| 5-th percentile | 146.14 |
| Q1 | 164.9 |
| median | 185 |
| Q3 | 192 |
| 95-th percentile | 228 |
| Maximum | 247 |
| Range | 125.25 |
| Interquartile range (IQR) | 27.1 |
Descriptive statistics
| Standard deviation | 21.35556707 |
|---|---|
| Coefficient of variation (CV) | 0.1176185234 |
| Kurtosis | 0.1226763387 |
| Mean | 181.5663592 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.07432397542 |
| Sum | 187013.35 |
| Variance | 456.0602447 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 192 | 118 | 11.5% |
| 228 | 54 | 5.2% |
| 185.7 | 46 | 4.5% |
| 203.5 | 36 | 3.5% |
| 186 | 28 | 2.7% |
| 162 | 20 | 1.9% |
| 164.9 | 20 | 1.9% |
| 185 | 15 | 1.5% |
| 153.5 | 15 | 1.5% |
| 200 | 14 | 1.4% |
| Other values (195) | 664 |
| Value | Count | Frequency (%) |
| 121.75 | 5 | |
| 126.6 | 5 | |
| 127 | 1 | 0.1% |
| 127.3 | 1 | 0.1% |
| 137.8 | 5 | |
| 140 | 1 | 0.1% |
| 140.75 | 5 | |
| 141.8 | 5 | |
| 142 | 1 | 0.1% |
| 143.3 | 5 |
| Value | Count | Frequency (%) |
| 247 | 1 | 0.1% |
| 246.9 | 1 | 0.1% |
| 237 | 1 | 0.1% |
| 236.7 | 1 | 0.1% |
| 228 | 54 | |
| 221.4 | 1 | 0.1% |
| 221 | 2 | 0.2% |
| 220.1 | 1 | 0.1% |
| 220 | 2 | 0.2% |
| 219.7 | 1 | 0.1% |
Superplasticizer (component 5)(kg in a m^3 mixture)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 155 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.20311165 |
| Minimum | 0 |
|---|---|
| Maximum | 32.2 |
| Zeros | 379 |
| Zeros (%) | 36.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 6.35 |
| Q3 | 10.16 |
| 95-th percentile | 16.055 |
| Maximum | 32.2 |
| Range | 32.2 |
| Interquartile range (IQR) | 10.16 |
Descriptive statistics
| Standard deviation | 5.973491651 |
|---|---|
| Coefficient of variation (CV) | 0.9629830942 |
| Kurtosis | 1.413185653 |
| Mean | 6.20311165 |
| Median Absolute Deviation (MAD) | 5.31 |
| Skewness | 0.9081127315 |
| Sum | 6389.205 |
| Variance | 35.6826025 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 379 | |
| 8 | 27 | 2.6% |
| 11.6 | 21 | 2.0% |
| 7 | 19 | 1.8% |
| 6 | 17 | 1.7% |
| 9 | 15 | 1.5% |
| 16.5 | 15 | 1.5% |
| 10 | 15 | 1.5% |
| 11 | 14 | 1.4% |
| 5.75 | 10 | 1.0% |
| Other values (145) | 498 |
| Value | Count | Frequency (%) |
| 0 | 379 | |
| 1.72 | 4 | 0.4% |
| 1.9 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 2.2 | 1 | 0.1% |
| 2.5 | 2 | 0.2% |
| 3 | 6 | 0.6% |
| 3.1 | 1 | 0.1% |
| 3.4 | 3 | 0.3% |
| 3.57 | 5 | 0.5% |
| Value | Count | Frequency (%) |
| 32.2 | 5 | |
| 28.2 | 5 | |
| 23.4 | 5 | |
| 22.1 | 1 | 0.1% |
| 22 | 6 | |
| 20.8 | 1 | 0.1% |
| 20 | 1 | 0.1% |
| 19 | 1 | 0.1% |
| 18.8 | 1 | 0.1% |
| 18.6 | 5 |
| Distinct | 284 |
|---|---|
| Distinct (%) | 27.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 972.9185922 |
| Minimum | 801 |
|---|---|
| Maximum | 1145 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 801 |
|---|---|
| 5-th percentile | 842 |
| Q1 | 932 |
| median | 968 |
| Q3 | 1029.4 |
| 95-th percentile | 1104 |
| Maximum | 1145 |
| Range | 344 |
| Interquartile range (IQR) | 97.4 |
Descriptive statistics
| Standard deviation | 77.75381809 |
|---|---|
| Coefficient of variation (CV) | 0.0799181131 |
| Kurtosis | -0.599000555 |
| Mean | 972.9185922 |
| Median Absolute Deviation (MAD) | 46.3 |
| Skewness | -0.04020640267 |
| Sum | 1002106.15 |
| Variance | 6045.656228 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 932 | 57 | 5.5% |
| 852.1 | 45 | 4.4% |
| 944.7 | 30 | 2.9% |
| 968 | 29 | 2.8% |
| 1125 | 24 | 2.3% |
| 1047 | 19 | 1.8% |
| 967 | 19 | 1.8% |
| 974 | 12 | 1.2% |
| 942 | 12 | 1.2% |
| 938 | 12 | 1.2% |
| Other values (274) | 771 |
| Value | Count | Frequency (%) |
| 801 | 4 | |
| 801.1 | 1 | 0.1% |
| 801.4 | 1 | 0.1% |
| 811 | 2 | |
| 814 | 1 | 0.1% |
| 814.1 | 1 | 0.1% |
| 817.9 | 1 | 0.1% |
| 818 | 1 | 0.1% |
| 819 | 2 | |
| 819.2 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1145 | 1 | 0.1% |
| 1134.3 | 5 | 0.5% |
| 1130 | 1 | 0.1% |
| 1125 | 24 | |
| 1124.4 | 2 | 0.2% |
| 1120 | 2 | 0.2% |
| 1119 | 2 | 0.2% |
| 1118.8 | 2 | 0.2% |
| 1118 | 1 | 0.1% |
| 1113 | 2 | 0.2% |
| Distinct | 304 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 773.5788835 |
| Minimum | 594 |
|---|---|
| Maximum | 992.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 594 |
|---|---|
| 5-th percentile | 613 |
| Q1 | 730.95 |
| median | 779.51 |
| Q3 | 824 |
| 95-th percentile | 898.068 |
| Maximum | 992.6 |
| Range | 398.6 |
| Interquartile range (IQR) | 93.05 |
Descriptive statistics
| Standard deviation | 80.1754274 |
|---|---|
| Coefficient of variation (CV) | 0.103642213 |
| Kurtosis | -0.1021647727 |
| Mean | 773.5788835 |
| Median Absolute Deviation (MAD) | 45.49 |
| Skewness | -0.2529792974 |
| Sum | 796786.25 |
| Variance | 6428.099159 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 755.8 | 30 | 2.9% |
| 594 | 30 | 2.9% |
| 670 | 23 | 2.2% |
| 613 | 22 | 2.1% |
| 801 | 16 | 1.6% |
| 746.6 | 15 | 1.5% |
| 887.1 | 15 | 1.5% |
| 845 | 14 | 1.4% |
| 712 | 14 | 1.4% |
| 750 | 12 | 1.2% |
| Other values (294) | 839 |
| Value | Count | Frequency (%) |
| 594 | 30 | |
| 605 | 5 | 0.5% |
| 611.8 | 5 | 0.5% |
| 612 | 1 | 0.1% |
| 613 | 22 | |
| 613.2 | 2 | 0.2% |
| 614 | 1 | 0.1% |
| 623 | 2 | 0.2% |
| 630 | 5 | 0.5% |
| 631 | 4 | 0.4% |
| Value | Count | Frequency (%) |
| 992.6 | 5 | |
| 945 | 4 | |
| 943.1 | 4 | |
| 942 | 4 | |
| 925.7 | 5 | |
| 905.9 | 5 | |
| 903.79 | 5 | |
| 903.59 | 5 | |
| 901.8 | 5 | |
| 900.9 | 5 |
| Distinct | 14 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.66213592 |
| Minimum | 1 |
|---|---|
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 7 |
| median | 28 |
| Q3 | 56 |
| 95-th percentile | 180 |
| Maximum | 365 |
| Range | 364 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 63.16991158 |
|---|---|
| Coefficient of variation (CV) | 1.383419989 |
| Kurtosis | 12.16898898 |
| Mean | 45.66213592 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 3.269177401 |
| Sum | 47032 |
| Variance | 3990.437729 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=14)
| Value | Count | Frequency (%) |
| 28 | 425 | |
| 3 | 134 | 13.0% |
| 7 | 126 | 12.2% |
| 56 | 91 | 8.8% |
| 14 | 62 | 6.0% |
| 90 | 54 | 5.2% |
| 100 | 52 | 5.0% |
| 180 | 26 | 2.5% |
| 91 | 22 | 2.1% |
| 365 | 14 | 1.4% |
| Other values (4) | 24 | 2.3% |
| Value | Count | Frequency (%) |
| 1 | 2 | 0.2% |
| 3 | 134 | 13.0% |
| 7 | 126 | 12.2% |
| 14 | 62 | 6.0% |
| 28 | 425 | |
| 56 | 91 | 8.8% |
| 90 | 54 | 5.2% |
| 91 | 22 | 2.1% |
| 100 | 52 | 5.0% |
| 120 | 3 | 0.3% |
| Value | Count | Frequency (%) |
| 365 | 14 | 1.4% |
| 360 | 6 | 0.6% |
| 270 | 13 | 1.3% |
| 180 | 26 | 2.5% |
| 120 | 3 | 0.3% |
| 100 | 52 | 5.0% |
| 91 | 22 | 2.1% |
| 90 | 54 | 5.2% |
| 56 | 91 | 8.8% |
| 28 | 425 |
Concrete compressive strength(MPa, megapascals)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATION| Distinct | 938 |
|---|---|
| Distinct (%) | 91.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.81783583 |
| Minimum | 2.331807832 |
|---|---|
| Maximum | 82.5992248 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 2.331807832 |
|---|---|
| 5-th percentile | 10.95942786 |
| Q1 | 23.70711515 |
| median | 34.44277358 |
| Q3 | 46.13628654 |
| 95-th percentile | 66.8045116 |
| Maximum | 82.5992248 |
| Range | 80.26741697 |
| Interquartile range (IQR) | 22.42917139 |
Descriptive statistics
| Standard deviation | 16.70567917 |
|---|---|
| Coefficient of variation (CV) | 0.4664067158 |
| Kurtosis | -0.3138436917 |
| Mean | 35.81783583 |
| Median Absolute Deviation (MAD) | 10.9281946 |
| Skewness | 0.4169222823 |
| Sum | 36892.3709 |
| Variance | 279.0797167 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 33.39821744 | 5 | 0.5% |
| 77.29715436 | 4 | 0.4% |
| 31.35047372 | 4 | 0.4% |
| 71.29871316 | 4 | 0.4% |
| 35.3011712 | 4 | 0.4% |
| 79.29663476 | 4 | 0.4% |
| 55.89581932 | 3 | 0.3% |
| 17.54026944 | 3 | 0.3% |
| 18.12632404 | 3 | 0.3% |
| 65.19685056 | 3 | 0.3% |
| Other values (928) | 993 |
| Value | Count | Frequency (%) |
| 2.331807832 | 1 | |
| 3.31982694 | 1 | |
| 4.565020596 | 1 | |
| 4.782205536 | 1 | |
| 4.827710952 | 1 | |
| 4.903553312 | 1 | |
| 6.26733684 | 1 | |
| 6.280436884 | 1 | |
| 6.46728488 | 1 | |
| 6.8085755 | 1 |
| Value | Count | Frequency (%) |
| 82.5992248 | 1 | 0.1% |
| 81.75116932 | 1 | 0.1% |
| 80.19984832 | 1 | 0.1% |
| 79.98611076 | 1 | 0.1% |
| 79.40005616 | 1 | 0.1% |
| 79.29663476 | 4 | |
| 78.80021204 | 1 | 0.1% |
| 77.29715436 | 4 | |
| 76.80073164 | 1 | 0.1% |
| 76.23536132 | 1 | 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Cement (component 1)(kg in a m^3 mixture) | Blast Furnace Slag (component 2)(kg in a m^3 mixture) | Fly Ash (component 3)(kg in a m^3 mixture) | Water (component 4)(kg in a m^3 mixture) | Superplasticizer (component 5)(kg in a m^3 mixture) | Coarse Aggregate (component 6)(kg in a m^3 mixture) | Fine Aggregate (component 7)(kg in a m^3 mixture) | Age (day) | Concrete compressive strength(MPa, megapascals) | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 540.000 | 0.000 | 0.000 | 162.000 | 2.500 | 1040.000 | 676.000 | 28 | 79.986 |
| 1 | 540.000 | 0.000 | 0.000 | 162.000 | 2.500 | 1055.000 | 676.000 | 28 | 61.887 |
| 2 | 332.500 | 142.500 | 0.000 | 228.000 | 0.000 | 932.000 | 594.000 | 270 | 40.270 |
| 3 | 332.500 | 142.500 | 0.000 | 228.000 | 0.000 | 932.000 | 594.000 | 365 | 41.053 |
| 4 | 198.600 | 132.400 | 0.000 | 192.000 | 0.000 | 978.400 | 825.500 | 360 | 44.296 |
| 5 | 266.000 | 114.000 | 0.000 | 228.000 | 0.000 | 932.000 | 670.000 | 90 | 47.030 |
| 6 | 380.000 | 95.000 | 0.000 | 228.000 | 0.000 | 932.000 | 594.000 | 365 | 43.698 |
| 7 | 380.000 | 95.000 | 0.000 | 228.000 | 0.000 | 932.000 | 594.000 | 28 | 36.448 |
| 8 | 266.000 | 114.000 | 0.000 | 228.000 | 0.000 | 932.000 | 670.000 | 28 | 45.854 |
| 9 | 475.000 | 0.000 | 0.000 | 228.000 | 0.000 | 932.000 | 594.000 | 28 | 39.290 |
Last rows
| Cement (component 1)(kg in a m^3 mixture) | Blast Furnace Slag (component 2)(kg in a m^3 mixture) | Fly Ash (component 3)(kg in a m^3 mixture) | Water (component 4)(kg in a m^3 mixture) | Superplasticizer (component 5)(kg in a m^3 mixture) | Coarse Aggregate (component 6)(kg in a m^3 mixture) | Fine Aggregate (component 7)(kg in a m^3 mixture) | Age (day) | Concrete compressive strength(MPa, megapascals) | |
|---|---|---|---|---|---|---|---|---|---|
| 1020 | 288.400 | 121.000 | 0.000 | 177.400 | 7.000 | 907.900 | 829.500 | 28 | 42.140 |
| 1021 | 298.200 | 0.000 | 107.000 | 209.700 | 11.100 | 879.600 | 744.200 | 28 | 31.875 |
| 1022 | 264.500 | 111.000 | 86.500 | 195.500 | 5.900 | 832.600 | 790.400 | 28 | 41.542 |
| 1023 | 159.800 | 250.000 | 0.000 | 168.400 | 12.200 | 1049.300 | 688.200 | 28 | 39.456 |
| 1024 | 166.000 | 259.700 | 0.000 | 183.200 | 12.700 | 858.800 | 826.800 | 28 | 37.917 |
| 1025 | 276.400 | 116.000 | 90.300 | 179.600 | 8.900 | 870.100 | 768.300 | 28 | 44.284 |
| 1026 | 322.200 | 0.000 | 115.600 | 196.000 | 10.400 | 817.900 | 813.400 | 28 | 31.179 |
| 1027 | 148.500 | 139.400 | 108.600 | 192.700 | 6.100 | 892.400 | 780.000 | 28 | 23.697 |
| 1028 | 159.100 | 186.700 | 0.000 | 175.600 | 11.300 | 989.600 | 788.900 | 28 | 32.768 |
| 1029 | 260.900 | 100.500 | 78.300 | 200.600 | 8.600 | 864.500 | 761.500 | 28 | 32.401 |
Most frequently occurring
| Cement (component 1)(kg in a m^3 mixture) | Blast Furnace Slag (component 2)(kg in a m^3 mixture) | Fly Ash (component 3)(kg in a m^3 mixture) | Water (component 4)(kg in a m^3 mixture) | Superplasticizer (component 5)(kg in a m^3 mixture) | Coarse Aggregate (component 6)(kg in a m^3 mixture) | Fine Aggregate (component 7)(kg in a m^3 mixture) | Age (day) | Concrete compressive strength(MPa, megapascals) | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 362.600 | 189.000 | 0.000 | 164.900 | 11.600 | 944.700 | 755.800 | 3 | 35.301 | 4 |
| 3 | 362.600 | 189.000 | 0.000 | 164.900 | 11.600 | 944.700 | 755.800 | 28 | 71.299 | 4 |
| 4 | 362.600 | 189.000 | 0.000 | 164.900 | 11.600 | 944.700 | 755.800 | 56 | 77.297 | 4 |
| 5 | 362.600 | 189.000 | 0.000 | 164.900 | 11.600 | 944.700 | 755.800 | 91 | 79.297 | 4 |
| 2 | 362.600 | 189.000 | 0.000 | 164.900 | 11.600 | 944.700 | 755.800 | 7 | 55.896 | 3 |
| 6 | 425.000 | 106.300 | 0.000 | 153.500 | 16.500 | 852.100 | 887.100 | 3 | 33.398 | 3 |
| 7 | 425.000 | 106.300 | 0.000 | 153.500 | 16.500 | 852.100 | 887.100 | 7 | 49.201 | 3 |
| 8 | 425.000 | 106.300 | 0.000 | 153.500 | 16.500 | 852.100 | 887.100 | 28 | 60.295 | 3 |
| 9 | 425.000 | 106.300 | 0.000 | 153.500 | 16.500 | 852.100 | 887.100 | 56 | 64.301 | 3 |
| 10 | 425.000 | 106.300 | 0.000 | 153.500 | 16.500 | 852.100 | 887.100 | 91 | 65.197 | 3 |